Utilizing failures in question and answer system responses to enhance the accuracy of question and answer systems

ABSTRACT

A computerized device for enhancing the accuracy of a question-answer system is disclosed. The computerized device comprises a question-answer system comprising software for performing a plurality of question answering processes. A receiver receives a question into the question-answer system. A processor that generates a plurality of candidate answers to the question is connected to the question-answer system. The processor determines a confidence score for each of the plurality of candidate answers. The processor evaluates sources of evidence used to generate the plurality of candidate answers. The processor identifies missing information from a corpus of data. The missing information comprises any information that improves a confidence score for a candidate answer. The processor generates at least one follow-on inquiry based on the missing information. A network interface outputs the at least one follow-on inquiry to external sources separate from the question-answer system.

CROSS REFERENCES TO RELATED APPLICATIONS

This application is a Continuation of application Ser. No. 13/563,838,filed on Aug. 1, 2012, which is a Continuation of U.S. application Ser.No. 13/547,492 filed Jul. 12, 2012, which claims the benefit under 35U.S.C. §120 as a non-provisional of U.S. Provisional Patent ApplicationSer. No. 61/508,143 entitled “A METHOD FOR UTILIZING FAILURES INQUESTION AND ANSWER SYSTEM RESPONSES TO ENHANCE THE ACCURACY OF QUESTIONAND ANSWER SYSTEMS”, filed on Jul. 15, 2011, the entire teachings ofwhich are incorporated herein by reference.

In addition, the present application is related to U.S. patentapplication Ser. No. 12/152,411, now issued as U.S. Pat. No. 8,275,803entitled “System and Method for Providing Answers to Questions,” filedon May 14, 2008, U.S. patent application Ser. No. 12/126,642, now issuedas U.S. Pat. No. 8,332,394 entitled “System and Method for ProvidingQuestion and Answers with Deferred Type Evaluation,” filed May 23, 2008,U.S. patent application Ser. No. 12/724,156, (U.S. Patent PublicationNo. 20110125734), entitled, “Questions and Answers Generation,” filedMar. 15, 2010, and to U.S. patent application Ser. No. 13/547,436 (U.S.Patent Publication 20130019285) entitled “Validating That a User IsHuman,” filed Jul. 12, 2012, the entire teachings of each areincorporated herein by reference.

BACKGROUND

The present disclosure relates to question-answer (QA) validationsystems, and more specifically, to a system and method to utilizefailures in system responses to enhance the accuracy of a QA system.

In information retrieval, question answering (QA) is the task ofautomatically answering a question posed to the system. To find theanswer to a question, a QA computer program may use either apre-structured database or a collection of documents, such as a datacorpus such as the World Wide Web or a local database. Question answer(“QA”) systems occasionally produce failures in executing their tasks,such as providing an incorrect answer response to question inputs.Consequently, in order to enhance the efficiency and utility of QAsystems, solutions are required to address these failures adequately.

SUMMARY

Embodiments herein disclose a mechanism for generating questions,presenting a question to a user, and using a response from the user todetermine if the user is human.

Embodiments herein disclose a mechanism in which an automated questionanswering system generates secondary questions, asks those questions toa pool of users, and, using answers to those secondary questions,determines which users are human.

According to an embodiment herein, a method for enhancing the accuracyof a question-answer system is disclosed. Missing information from acorpus of data is identified. The missing information is any informationthat improves a confidence for a candidate answer to a question. Afollow-on inquiry is generated. The follow-on inquiry prompts for themissing information to be provided. The follow-on inquiry is output toan external source. A response to the follow-on inquiry is received fromthe external source. The response is added to the corpus of data.

According to another embodiment herein, a method for enhancing theaccuracy of a question-answer system is disclosed. A question isreceived into a computerized question-answer system operating on acomputerized device to provide a plurality of candidate answers to thequestion. The sources of evidence used to generate the plurality ofcandidate answers are evaluated to identify marginal evidence. Themarginal evidence contributes only partially to a candidate answer. Thecomputerized device determines a confidence score for each of theplurality of candidate answers. The computerized device identifiesinformation not provided by the marginal evidence that could furtherdevelop the confidence score. At least one follow-on inquiry isformulated based on the information. The inquiry is output to externalsources separate from the question-answer system to obtain responses tothe follow-on inquiry. At least one response to the inquiry is receivedfrom the external sources. The at least one response is input into thequestion-answer system. Additional logical rules and additional evidencesources are developed for the computerized question-answer system basedon the at least one response to the at least one follow-on inquiry.

According to another embodiment herein, a method is disclosed. A firstquestion to be answered by a Question Answering (QA) system is providedto a processor. The processor creates a collection of candidate answersto the first question. The collection of candidate answers is createdfrom a corpus of data. The processor generates supporting evidence and aconfidence score for each candidate answer. An evidence analysis moduleproduces a second question based on the supporting evidence. Theprocessor presents the second question to one or more external sourcesseparate from the QA system to obtain responses to the second question.The processor receives at least one response or knowledge item from theone or more external sources. The at least one response or knowledgeitem is input into the corpus of data. Additional logical rules andadditional evidence for the QA system are developed based on the atleast one response or knowledge item.

According to another embodiment herein, another method is disclosed. Aquestion is received into a Question Answering (QA) system. The QAsystem compares the question to a corpus of data. The QA systemgenerates hypotheses about relationships between linguistic and semanticentities of the question and the corpus of data. The QA system generatesa plurality of candidate answers to the question using the hypotheses.The QA system evaluates sources of evidence used to generate theplurality of candidate answers to identify marginal evidence. Themarginal evidence contributes only partially to a candidate answer. TheQA system determines a confidence score for each of the plurality ofcandidate answers. A missing piece of common-sense knowledge that wouldenable the QA system to further develop the confidence score isdetermined. A follow-on inquiry to obtain the missing piece ofcommon-sense knowledge is formulated. The follow-on inquiry is output toan external expert community source separate from the QA system toobtain responses to the follow-on inquiry. Responses comprising themissing piece of common-sense knowledge are received from the externalexpert community source. The missing piece of common-sense knowledge isinput into the QA system. The QA system automatically developsadditional logical rules and additional evidence for the QA system basedon the obtained missing piece of information.

According to another embodiment herein, a computerized device forenhancing the accuracy of a question-answer system is disclosed. Thecomputerized device comprises a question-answer system comprisingsoftware for performing a plurality of question answering processes. Areceiver receives a question into the question-answer system. Aprocessor that generates a plurality of candidate answers to thequestion is connected to the question-answer system. The processordetermines a confidence score for each of the plurality of candidateanswers. The processor evaluates sources of evidence used to generatethe plurality of candidate answers. The processor identifies missinginformation from a corpus of data. The missing information comprises anyinformation that improves a confidence score for a candidate answer. Theprocessor generates at least one follow-on inquiry based on the missinginformation. A network interface outputs the at least one follow-oninquiry to external sources separate from the question-answer system toobtain responses to the at least one follow-on inquiry and receives fromthe external sources at least one response to the at least one follow-oninquiry. The processor inputs the at least one response into the corpusof data.

According to another embodiment herein, a computer system for enhancingthe accuracy of a question answering system is disclosed. The computersystem comprises an automated question answering (QA) system comprisinga corpus of data, a processor, a receiver, and a network interface. Theprocessor has software for performing a plurality of question answeringprocesses. The receiver receives a question into the automated QAsystem. The processor generates a plurality of candidate answers to thequestion. The processor evaluates sources of evidence used to generatethe plurality of candidate answers to identify marginal evidence. Themarginal evidence only partially contributes to a candidate answer. Theprocessor determines a confidence score for each of the plurality ofcandidate answers. The processor identifies information not provided bythe marginal evidence that would enable the QA system to further developthe confidence score. The processor automatically generates at least onefollow-on inquiry based on the information. The network interfaceoutputs the at least one follow-on inquiry to external sources separatefrom the QA system and receives from the external sources at least oneresponse to the at least one follow-on inquiry. The processor inputs theat least one response into the corpus of data. The processorautomatically develops additional logical rules and additional evidencefor the automated QA based on the at least one response to the at leastone follow-on inquiry.

According to another embodiment herein, a question answering (QA) systemis disclosed. The QA system comprises a processor. An evidence analysismodule is connected to the processor. A first interface is connected tothe processor. A second interface is connected to the processor. Acorpus of data is connected to the evidence analysis module. The firstinterface receives a first question to be answered by the QA system. Theprocessor creates a collection of candidate answers to the firstquestion from the corpus of data. Each candidate answer has supportingevidence and a confidence score generated by the processor. The evidenceanalysis module produces a second question based on the supportingevidence. The processor presents the second question through the secondinterface to one or more external sources separate from the QA system toobtain responses to the second question. The processor receives at leastone response or knowledge item from one or more external sources throughthe second interface. The processor inputs the at least one answer orknowledge item into the corpus of data. The processor automaticallydevelops additional logical rules and additional evidence for the QAbased on the at least one response or knowledge item.

According to another embodiment herein, a non-transitory computerreadable storage medium readable by a computerized device is disclosed.The computerized device comprises a question-answer system comprisingquestion answering processes. The non-transitory computer readablestorage medium stores instructions executable by the computerized deviceto perform a method. According to the method, missing information from acorpus of data is identified. The missing information is any informationthat improves a confidence for a candidate answer to a question. Afollow-on inquiry is generated. The follow-on inquiry prompts for themissing information to be provided. The follow-on inquiry is output toan external source. A response to the follow-on inquiry is received fromthe external source. The response is added to the corpus of data.

BRIEF DESCRIPTION OF THE DRAWINGS

The embodiments herein will be better understood from the followingdetailed description with reference to the drawings, which are notnecessarily drawn to scale and in which:

FIG. 1 is a flow diagram of a method according to an embodiment herein;

FIG. 2 is an illustration of aspects of a system according to anembodiment herein;

FIG. 3 is an illustration of aspects of a system according to anembodiment herein;

FIG. 4 is an illustration of aspects of a system according to anembodiment herein;

FIG. 5 is an illustration of inquiry examples according to an embodimentherein;

FIG. 6 is an illustration of aspects of a system according to anembodiment herein;

FIG. 7 is an illustration of aspects of a system according to anembodiment herein;

FIG. 8 is an illustration of aspects of a system according to anembodiment herein;

FIG. 9 is a flow diagram of a method according to an embodiment herein;

FIG. 10 is an illustration of articles of manufacture according toembodiments herein;

FIG. 11 is a schematic diagram of a hardware system according toembodiments herein.

FIG. 12 is a schematic diagram of a deployment system according toembodiments herein;

FIG. 13 is a schematic diagram of an integration system according toembodiments herein;

FIG. 14 is a schematic diagram of an on demand system according toembodiments herein;

FIG. 15 is a schematic diagram of a virtual private network systemaccording to embodiments herein; and

FIG. 16 is a schematic diagram of a virtual private network systemaccording to embodiments herein.

DETAILED DESCRIPTION

It will be readily understood that the embodiments herein, as generallydescribed and illustrated in the figures herein, may be arranged anddesigned in a wide variety of different configurations in addition tothe described embodiments. Thus, the following detailed description ofthe embodiments herein, as represented in the figures, is not intendedto limit the scope of the invention, as defined by the appended claims,but is merely representative of selected embodiments. The followingdescription is intended only by way of example, and simply illustratescertain selected embodiments herein.

The QA system described herein is not merely a document search system.One key difference between QA technology and simple document searchtechnology is that document search typically takes a keyword query andmerely returns a list of documents, ranked in order of relevance to thequery. On the other hand, QA technology takes a question expressed innatural language, seeks to understand it in detail, and returns aprecise answer to the question. A typical scenario is for an end user toenter a question in natural language form, much as if they were askinganother person, and for the system to sift through vast amounts ofpotential evidence to return a ranked list of the most compelling,precise answers. These answers may also include summaries of theirjustifying or supporting evidence, allowing the user to quickly assessthe evidence and select the correct answer.

Given sufficient search data, a document search can usually finddocuments or passages containing some keywords in common with the query,but lack the precision, depth, and understanding necessary to delivercorrect answers with accurate confidence.

While an Internet web search engine has access to an immense source ofinformation and can quickly find relevant web pages given a small numberof query terms, such a search engine does not generate precise answersto the questions. A web search engine does not return answers; rather,it is designed to return a ranked list of web pages that the user may betrying to find.

A challenge for even finding the relevant documents is the issue ofchoosing the right set of keywords to retrieve those documents. Manyquestions contain information that is not critical for answering thequestion, but is provided for educational and/or entertainment purposes.Moreover, the question may use terms different from those used inevidence that may contain a correct answer. As a result, formulating aneffective query that returns the relevant documents is a critical andnon-trivial task.

In some cases, the question requires an answer that must be synthesizedfrom multiple pieces of evidence from multiple source documents; thatis, the answer is a list of items or a logical combination of two ormore items. These answers do not appear in any one place. Rather, the QAtechnology must synthesize the answer from independent sources to form afinal answer.

Classic knowledge-based artificial intelligence approaches to QA systemstry to logically prove an answer is correct from a logical encoding ofthe question and all the domain knowledge required to answer it. Suchapproaches are stymied by the prohibitive time and manual effortrequired to acquire massive volumes of knowledge and formally encode itas logical formulas accessible to computer algorithms, and thedifficulty of understanding natural language questions well enough toexploit such formal encodings if available. Consequently, QA systemstend to falter in terms of breadth, but when they succeed, they are veryprecise.

Aspects of the various embodiments implement the use of failures in QAsystems to motivate inquiries for common sense knowledge, and employcrowd sourcing to respond to those inquiries. These failures can be usedin QA processes to determine what common-sense knowledge would be usefulto have in order to improve answer retrieval. One aspect of the QAsystem is to be able to discover and pose follow-on inquiries to a user(or an external expert community) that, if answered, will improve theability of the QA system to understand and evaluate supporting evidencefor questions. Additionally, the acquired common-sense knowledge can beapplied either off-line or during a live question answering session.Furthermore, specific QA process failures can be identified and used tospecify specific gaps in the system's common-sense knowledge, which canthen be filled. Moreover, specific user-interface elements forcommon-sense knowledge questions are provided and employed.

According to embodiments herein, when the QA system is unable to obtainan answer with a high degree of confidence, the QA system generates andposes follow-on inquiries in order to acquire information that canchange the degree of confidence in an answer. The QA system solicitsresponses to the follow-on inquiries from external sources either inonline mode, in which case the external sources' response to thefollow-on inquiry will influence the QA system's answer to the currentquestion, or in offline mode, where follow-on inquiries are dispatchedto an external expert community at a later time.

It is often the case that the information need is not well captured bythe QA system, as the question processing part may fail to classify thequestion properly or the information needed for extracting andgenerating the answer is not easily retrieved. In such cases, the QAsystem may reformulate the question or request assistance from anoutside source.

The use of an external source of information to obtain information isknown in the art as crowd sourcing. Crowd sourcing is a process thatinvolves sending tasks to a distributed group of people. In crowdsourcing, the task or problem is typically sent to an undefined publicrather than a specific body. Using failures of the QA system to motivateinquiries for common sense knowledge and employing crowd sourcing torespond to those inquiries, embodiments herein can evaluate agreementamong the various respondents regarding responses to the inquiries inorder to validate that the respondents are human and that the responseswere not generated by an automated script.

In some embodiments, a question may be related to an interrogatorysearching for specific information, such as, “What is the capital of NewYork?” In some embodiments, a question may be posed in the form of ananswer or statement, such as done in the popular game show “Jeopardy!”

FIG. 1 is a flow diagram illustrating one non-limiting embodimentherein. According to FIG. 1, one embodiment described herein enables acomputer implemented method for enhancing the accuracy of aquestion-answer system in providing responses to questions inputted tothe question-answer system.

The first step of the method, at 110, involves providing the QA system.The QA system uses multiple question answering processes and evidencefrom a corpus of data to provide answers to questions. The detailsregarding processes and evidence for an example QA system that can beused in a preferred embodiment are described in co-pending U.S. patentapplication Ser. No. 12/152,411, incorporated herein by reference.

In the next step of the method, at 120, a question is input to the QAsystem. The question may be in the form of a statement or an answerseeking an appropriate question.

Next, at 130, a failure in a question answering process is determined.The QA system generates one or more candidate answers to the questionwith associated confidence scores based on results from scoringprocesses/algorithms for pieces of evidence extracted from a corpus ofdata. A failure may result either from an inability to generate acandidate answer with a confidence score above a threshold value or ifthe QA system cannot correctly interpret the question. Additionally, afailure may also result from an individual piece of evidence receiving ascore below a threshold value.

At 140, the failure is used to determine a missing piece of information.The missing piece of data/information may be data/information that wouldenable the QA system to improve a score for a piece of evidence, forexample a passage, wherein the score for the piece of evidence is usedin a confidence score for a candidate answer. The missing informationmay be any item of data, a fact, a syntactical relationship, agrammatical relationship, a logical rule, a taxonomy rule, a grammaticalrule, or any other information that would increase a determined scorefor a piece of evidence that may support or refute a candidate answer tothe question.

Next, at 150, a follow-on inquiry is output to obtain the missing pieceof information. The inquiry may be directed to outside sources that caninclude a variety of users in an expert community who may be human usersor may be other electronic systems capable of providing a response, suchas other QA systems. A follow-on inquiry may involve, for example,keyword matching, expansion of the original question, and/or a requestfor lexical semantic relationships. For example, the QA system mightrequest a clarification of what sense a word is being used, or what typeof information is being requested in the question.

At 160, the QA system receives a response to the follow-on inquiry. Theresponse is returned by a human user, expert community or other QAsystem. At 170, the response to the follow-on inquiry is validated toconfirm the missing piece of data. The validation may include validationthat the response is supported by a threshold number of experts, humans,or QA systems.

Finally, at 180, the obtained missing piece of data is added into thequestion-answer system. Again, the missing piece of data may any item ofdata, a fact, a syntactical relationship, a grammatical relationship, alogical rule, a taxonomy rule, a grammatical rule, or any otherinformation that would increase a determined score for a piece ofevidence that may support or refute a candidate answer to the question.The missing piece of data may be input into the corpus, algorithm,process, logical rule, or any other location or combination thereofwherein the data may affect the resulting score for a piece of evidence.

In some embodiments, at least one step of the method is carried outusing a computer device. In some embodiments, the response to thefollow-on inquiry may be evaluated to determine whether or not theresponse was provided by a human respondent.

Referring to FIG. 2, a general workflow of the QA system 210 is shown.The QA system 210 receives a question or questions 220 and searches acorpus of data 225 to extract pieces of evidence 230,240,250 that areevaluated by a number of algorithms in order to generate candidateanswers 285 to the question 220. The QA system 210 attempts to generatemany candidate answers 285 to the question 220 presented. To do so, theQA system 210 considers a wide variety of information, e.g., texts ofall kinds, web documents, reference books, novels, plays, encyclopedias,dictionaries, thesauri, textbooks, and technical reports, included inthe corpus of data 225.

The corpus of data 225 includes quantities of digitally encoded,structured and unstructured information, such as fragments of documents,passages, internal or external knowledge bases, natural languagedocuments, corporate intranets, reference books, textbooks, technicalreports, blogs, etc. These lists are intended to be non-limitingexamples of sources of information for the QA system 210. The vastmajority of the corpus of data 225 comprises a wide variety of naturallanguage text. Some of the data may be structured (formal knowledgebases) and some of the data may be semi-structured (tagged text) orunstructured (untagged text) in order to help interpret text and refineanswers.

As used herein, ‘pieces of evidence’ refers to the specific informationor data that the QA system 210 evaluates to refute or support candidateanswers 285 to the question 220. The QA system 210 draws pieces ofevidence from the corpus of data 225. The term piece of evidence is notintended to limit the length of evidence used and may include adocument, passage from a document, data, meta-data, entry in a databaseor any other information capable of contributing to a confidence scorefor a candidate answer. Additionally, as used herein, ‘logical rules’refer to the lexical, grammatical, and/or semantic relationships thatthe QA system 210 uses to generate and score candidate answers 285 tothe question 220 based on the pieces of evidence. The logical rulesenable the QA system 210 to draw an inference from the pieces ofevidence in order to evaluate the candidate answers 285.

In trying to generate candidate answers 285, the QA system 210 extractspieces of evidence from the corpus of data 225. The pieces of evidence,based on the question, can be fit into three categories: good evidence230, bad evidence 240, and marginal evidence 250. The QA system 210compares the question 220 to the data in the corpus of data 225 andassigns an evidence score to the extracted evidence based on how wellthe evidence matches the question 220. Sometimes there is data in thecorpus of data 225 that clearly and directly relates to the inputquestion. This is referred to as “good” evidence. Good evidence 230 is apiece of evidence with a high evidence score; that is, an evidence scoreabove a previously established evidence threshold value such that itcontributes significantly to a confidence score for a candidate answer.Conversely, bad evidence 240 is a piece of evidence with such lowrelevance that it does not contribute to a confidence score for acandidate answer. A QA system 210 may not extract bad evidence from thecorpus, or may extract bad evidence in order to evaluate and determineoutright that this evidence will not provide a candidate answer with anyconfidence. Lastly, there are also pieces of evidence that the QA system210 is able to identify as relevant to the question and may produce anevidence score that contributes significantly to a confidence score fora candidate answer if a missing link or missing piece of information isprovided. A missing link or missing piece of information could bedescribed as all the marginal evidence, or a subset of the marginalevidence (either all relatively high scoring marginal evidence, allrelatively low scoring marginal evidence, or some middle range of scoresfor the marginal evidence (based on predetermined score parameters)).Thus, if a piece of marginal evidence falls within some predeterminedscore range, it would be considered a piece of missing information, forwhich additional information will be obtained. In other words, themarginal evidence 250 is evidence that the QA system 210 can use ingenerating candidate answers 285, but is lacking in some manner, suchthat the score for the piece of evidence does not contributesignificantly to the overall confidence score for a candidate answer285. A low confidence score for all candidate answers 285 indicates thatthe QA system 210 should obtain additional information or assistance tofurther develop the confidence scores of the candidate answers 285. Inother words, low confidence scores may serve as a trigger for the QAsystem 210 to evaluate whether pieces of evidence 230, 240, 250 for acandidate answer 285 are marginal evidence 250 such that a missinginformation or missing link may be identified.

The QA system 210 uses the good evidence 230 and the marginal evidence250 to generate candidate answers 285. Each candidate answer 285 isassigned a confidence score, which is an aggregation of the evidencescores. A candidate answer 285 with a low confidence scores; that is,confidence scored below a previously established confidence thresholdvalue indicates that the QA system 210 should obtain additionalinformation or assistance to further develop the confidence scores ofthe candidate answers 285. In other words, bad evidence does notcontribute to the candidate answers 285, marginal evidence contributesonly partially to the candidate answers 285, and good evidence provideshigh confidence in the candidate answers 285.

For a candidate answer generated, the QA system 210 evaluates aself-assessed probability that the candidate answer 285 is a correctanswer to the question 220, or in other words, the QA system 210generates a confidence score for a candidate answer 285. In order togenerate candidate answers 285, the QA system 210 analyzes the questionto figure out what is being asked. It also analyzes the corpus of data225 based on the question 220 in order to extract candidate answers 285,and computes a reliable confidence in light of whatever supporting orrefuting pieces of evidence 230, 240, 250 it finds. To obtain confidencein a candidate answer 285, the QA system 210 attempts to justify theanswer by multiple sources in the corpus of data 225. The degree towhich the probability that the candidate answer 285 is a correct answerto the question 220 correlates to the confidence score for the candidateanswer 285. The QA system 210 can then rank the candidate answers 285based on the confidence scores.

The QA system 210 generates follow-on inquiries 260 to supplement themarginal evidence 250 by identifying an aspect of the piece of evidencethat is lacking, in other words, identifying the missing piece ofinformation or missing link. The QA system 210 uses follow-on inquiries260 to clarify the aspect of the evidence that the QA system 210 failedto understand. According to an embodiment herein, the follow-oninquiries 260 are sent to an external source 272, which may include acommunity of potential respondents 270, which may be on an electronicnetwork, such as the Internet 275. After the responses 280 to thefollow-on inquiries have been evaluated, the validated responses maythen be fed back into the QA system 210 as data, a fact, a syntacticalrelationship, a grammatical relationship, a logical rule, a taxonomyrule, a grammatical rule, or any other information that would increase adetermined score for a piece of evidence that may support or refute acandidate answer to the question. The potential respondents 270 may befrom an external expert community who may be human users or otherelectronic systems capable of providing a response, such as other QAsystems. The responses 280 to the follow-on inquiries may include themissing information or missing link necessary to make the marginalevidence 250 useful in order to improve the confidence scores of one ormore candidate answers 285. Such missing information can be used toproduce improved candidate answers 285 to the inputted question or canbe used to produce subsequent follow-on inquiries 260. In either case,the information can be used to enhance the internal knowledge base ofthe QA system 210 and improve the confidence in the candidate answers285. The missing information included in the responses 280 to thefollow-on inquiries can also help repair flaws in the question answeringprocesses used by the QA system 210 by incorporating the information ofthe response, and the manner that provided the response, into thequestion answering processes.

In some embodiments herein, the QA system 210 evaluates the responses280 to the follow-on inquiries to determine agreement among the variousrespondents regarding the responses 280 to the follow-on inquiries, inorder to validate that the respondents are human and that the responseswere not generated by an automated script.

Inputs to the QA system 210 may include questions 220, the evidenceextracted from the corpus of data 225, and responses 280 to follow-oninquiries. Outputs from the QA system 210 include candidate answers 285and follow-on inquiries 260. This provides low cost, continuousknowledge acquisition.

FIG. 3 shows one approach to identifying marginal evidence 250. The QAsystem 210 receives a question 320 and parses the question into acollection of elements 340. The QA system 210 searches the corpus ofdata 225 for a piece of evidence, for example a passage, such as 330, tosupport or refute a candidate answer 285 to the question. The QA system210 parses the passage 330 into a collection of elements 350. The QAsystem 210 analyzes the passage 330 by seeking to match, or associate insome manner, the elements 340 in the question 320 and the elements 350in the passage 330 based on lexical, grammatical, and/or semanticrelationships. The QA system 210 may, for example, attempt to match thevarious individual elements, such as 345, 355 with correspondingsemantic roles in the question 320 and the passage 330. The QA system210 then classifies the passage as good, bad, or marginal as indicatedat 360. As used herein, ‘marginal’ means that some part of the passage330 may be relevant, but it needs additional information to improve ascore for the piece of evidence based on a scoring algorithm or process.This additional information can be labeled as a “missing link” or“missing information” that, when connected or combined with the otherinformation in the passage 330, enables the QA system 210 to furtherdevelop the confidence score of the candidate answers 285.

In the approach illustrated in FIG. 3, the inputs to the QA system 210include the question 320 and the passage 330 that is used for comparisonto the question 320. Outputs from the QA system 210 includeclassification of the passage 330 into good, bad, and marginal matchesto the question 320, or simply identification of the marginal evidencewithout requiring identification of the good or bad evidence. Themarginal evidence includes missing information thus follow-on inquiries260 can be generated in order provide the missing information from anexternal source.

The QA system 210 may employ a variety of specific mechanisms forfinding marginal evidence. Aligning questions to sample passages is oneof the ways that the QA system 210 uses evidence to evaluate candidateanswers 285. Given an answer key, it can be used to drive the search fornear misses. In general, there are many kinds of “near miss” failuresthat the QA system 210 could encounter, for example:

-   -   Answer typing: the system could have determined that the answer        had the correct type if only it had known that two types were        equivalent    -   Keyword search: the system could have found a relevant passage        using a keyword search if only it had expanded some keyword in        the clue; e.g., by synonyms, antonyms, related terms, etc.    -   Relation detection for knowledge-base lookup: the system could        have looked up the answer in a knowledge base if only it had        figured out what relationship the question was asking for.        Each kind of near miss can imply some kind of semantic inquiry.

In some embodiments, mismatched elements, such as 345, 355, may actuallybe equivalent or have a counterpart as a logically necessaryconsequence. A follow-on inquiry 260 can be used to determine whetherthe mismatched elements are equivalent. Such follow-on inquiry 260 canuse the equivalence to decide if the passage 330 aligns with thequestion 320, and can also be used to resolve similar alignment problemsin future questions.

The QA system 210 analyzes lexical, grammatical, and semanticrelationships of the words within each question 320; therefore, the QAsystem 210 breaks each question 320 down into many separate elements 340(including keywords, equivalent keywords, logical relationships,restrictions, subject categories, etc.). For example, the good evidence230 matches a sufficient number of the elements to provide high scorefor the piece of evidence; that is, a score above a threshold value thatcontributes significantly to a confidence score for a candidate answer285. For example, the marginal evidence 250 matches some of the elementsof the question 320, but not all such elements resulting in a score forthe piece of evidence below a previously established threshold value.The systems and methods herein identify the elements of the question 320that the marginal evidence elements were not able to match. Thus, thesystems and methods herein identify the portions of the question 320that were not provided by the marginal evidence 250, and generatefollow-on inquiries 260 to request the external sources 272 to supplyinformation to clarify the portions of the question 320 that were notprovided by the marginal evidence 250. In some embodiments, thefollow-on inquiries 260 may use a natural language format.

In other words, the systems and methods herein automatically identifythe missing link that, if known, would increase the confidence score forone or more candidate answers 285. This missing link can be an item ofdata, a fact, a syntactical relationship, a grammatical relationship, alogical rule, a taxonomy rule, a grammatical rule, etc. By generatingthe follow-on inquiry, systems and methods herein can receive a numberof responses (from, for example, a crowd sourcing environment). Theseresponses can be filtered to make sure that the responses are fromactual humans (as opposed to computerized answering systems), forexample, based on responses to supply missing information that a machinewould have trouble answering. Additionally, the responses can befiltered to make sure that the responses are above a certain level ofaccuracy (based upon the reputation of the responder, correlation withthe known correct answer, agreement of the responses with one another,etc.). Then, once the responses are filtered, the high-quality,human-based responses can be utilized to increase of the knowledge baseof the question answer system, and/or also used to generate additionalrules to help the question answer system score and rank the candidateanswers 285.

FIG. 4 demonstrates a practical application of the approach to identifymarginal evidence illustrated in FIG. 3. In FIG. 4, the QA system 210compares the question 420 to the passage 430. In the example of FIG. 4,the question 420 is “Eleazar killed one of these huge beasts.” In otherwords, the question 420 is looking for an answer as to what type ofbeast was killed by Eleazar. In other aspects of the QA system 210 notshown, the piece of evidence, or passage 430, has been identified asrelevant to answering the question 420. In performing the analysisbetween the passage 430 and the question 420, the QA system 210evaluates the text for lexical, grammatical, and/or semanticrelationships among the elements 440 of the question 420 and theelements 450 of the passage 430. In the example of FIG. 4, the QA system210 can match some elements, such as Eleazar 445, to conclude that thepassage is of some relevance. However, the QA system 210 may not be ableto determine that “stabbed” 457 and “killed” 447 have a logicalrelationship. This failure of the QA system 210 reveals the missinginformation or missing link 460. In this example, the missing link is toresolve whether “killed” 447 is logically related to “stabbed” 457.Therefore, in this example, the systems and methods herein couldgenerate a follow-on inquiry 260 that asks for a crowd source responseto the question “does stabbing imply killing?” If the response 280 tothis follow-on inquiry is to the affirmative, the systems and methodsherein can automatically learn that the two words may be synonyms.According to an embodiment herein, the system concludes that the passage430 would receive a higher score and contribute more significantly tothe confidence score for a candidate answer 285 of “elephant” if“killed” is logically related to “stabbed.”

According to an embodiment herein, FIG. 5 provides examples of somespecific examples of follow-on inquiries 260. As illustrated in Example1 of FIG. 5, natural language follow-on inquiries 510 are actual naturallanguage questions generated by the QA system 210 that can be supplieddirectly to external sources 272, such as another QA system, anotherinformation source, or human respondents 270 on an electronic network(such as the Internet 275), so that the QA system 210 can acquire themissing information needed to better respond to the question 220.

As illustrated in Example 2 of FIG. 5, given the context of the questionand passage, the follow-on inquiries can be multiple-choice taxonomiesusing text and/or diagrams 515. As would be known in the relevant art,synonyms are different words with similar or almost identical meanings,a hyponym is a word or phrase whose semantic field is included withinthat of another word; for example: rose is a hyponym of flower, and ahypernym is a word with a general meaning that has basically the samemeaning of a more specific word; for example, the word “color” is ahypernym for red, green, blue, brown, etc.

By acquiring responses in a crowd sourcing manner, the systems andmethods herein allow the QA system 210 to answer questions moresimilarly to a human expert and to be customized for differentlocalities. More specifically (with reference to answering the questionof FIG. 4), purely logical rules may require that the correct answer inthe multiple-choice example 515 be selection “D” because “stab” and“kill” sometimes overlap, but not always (surely logically eliminatinganswers A-C). However, depending upon local speech customs, “killing”and “stabbing” can be considered synonyms. By automatically learningthat “killing” and “stabbing” are considered synonyms in a sufficientnumber of situations, the systems and methods herein allow the QA system210 to answer questions with higher confidence, and provides moreflexibility in its word definitions and grammatical/logical rules. Theability of the systems and methods herein to automatically learn ofhuman peculiarities by utilizing crowd sourced responses allows the QAsystem 210 to respond more consistently as a human would when generatinganswers to questions. This further underscores the need to filter outnon-human responses 280 to the follow-on inquiry because non-humanresponses may give the correct, albeit purely logical response, ifsupplied by a machine-based answering system. The QA system 210 hereinis able to receive information a human would consider to be a correctresponse (even if such a response might not be the most correct answerin a purely logical sense).

According to embodiments herein, the QA system 210 can receive a numberof responses, for example, from a crowd sourcing environment or anexternal expert community. These responses can be filtered to make surethat the responses are from actual humans, as opposed to computerizedanswering systems, and that the responses are above a certain level ofaccuracy either based upon the reputation of the responder, correlationwith the known correct answer, or agreement of the responses with oneanother.

An additional example of using follow-on inquiries is described below:

-   Question: This star of “Emmerdale Farm” also plays Elvis in “If You    Ever Get to Heaven.”

In this example, the question is looking for a “star”; however, the QAsystem 210 may not know if that means a star in the sky or a star on thestage. An appropriate answer to the question is “Samuel Kane”. It ispossible that a first source from the corpus of data identifies SamuelKane as an actor. However, the QA system may not be able to determinethat Samuel Kane is a “star.” As such, the QA system has identifiedmissing information or a missing link regarding the lexical relationshipbetween an actor and a star. In an embodiment herein, the QA system 210may direct some follow-on inquiries. Some examples of follow-oninquiries may include:

Is every actor a star?

Is every star an actor?

Are most actors also stars?

Are many actors also stars?

If I know that some entity is a star, would I suspect that this entityis an actor?

If I know that some entity is an actor, would I suspect that this entityis a star?

If I know that some entity is an actor, can I be certain that thisentity is a star?

Using follow-on inquiries, the QA system 210 tries to determine moreabout being an actor and being a star. By evaluating the responses tothe follow-on inquiries, the QA system 210 determines that this questionis asking for a “star.” If the QA system 210 finds a possible answerthat is an actor, can it be certain that this answer is a star?Conversely, if the QA system 210 finds a possible answer that is anactor, should it suspect that this answer is a star? According to oneresource, the QA system 210 has found a possible answer: Samuel Kane.Based on that information, the QA system 210 believes that Samuel Kaneis an actor. Should the QA system 210 conclude that Samuel Kane is astar because it believes that Samuel Kane is an actor? Should the QAsystem 210 suspect that Samuel Kane is a star because it believes thatSamuel Kane is an actor?

An additional example herein employs question expansion to generatefollow-on inquires, as follows:

-   Question: “This television quiz show host from Mexico was born in    the last few days of the 60's.”

In this example, the question is looking for several specific elements.An appropriate answer to the question is “Marco Antonio Regil.”According to an embodiment herein, the QA system 210 uses the questiontext to build a keyword search question. The QA system 210 considersmany possible expansions of the question; e.g., show is related toprogram, born is related to birth, television is related to TV, etc.There is a relevant piece of evidence or passage that states, “MarcoAntonio Regil (born Dec. 27, 1969) is a Mexican television personalityand game show host, born in Tijuana, Mexico.” An embodiment herein canfind this relevant passage since it contains the correct answer and somekeywords from the clue. The QA system 210 can then look for expansionsto the original question that would have produced this passage. The QAsystem 210 finds several expansions: quiz show is related to game show,Mexico is related to Mexican, 60's is related to 1969. These could beused as training data to train a more effective question expander ifthey are actually implied by the clue. Thus, this embodiment can enablethe following follow-on inquiries:

If a question is requesting information about a quiz show, can I becertain that information about a game show is relevant?

If a question is requesting information about a quiz show, should I besuspect that information about a game show is relevant?

If a question is requesting information about the 60's, can I be certainthat information about 1969 is relevant?

If a question is requesting information about the 60's, should I becertain that information about a 1969 is relevant?

Additionally, the QA system 210 can present follow-on inquiries topotential respondents 270 (internet users, other QA system, etc.)requesting potential respondents 270 to rank the terms of the questionfrom most relevant to the topic of this question to least relevant:

-   -   a) game show    -   b) birth    -   c) Mexican    -   d) program    -   e) TV    -   f) 1969

The QA system 210 can present follow-on inquiries to potentialrespondents 270 (internet users, other QA system, etc.) requestingpotential respondents 270 to rate the terms on a number scale based onrelevance to the topic of this question, for example rate from 1-10 interm of relevance the following terms:

-   -   a) game show    -   b) birth    -   c) Mexican    -   d) program    -   e) TV    -   f) 1969

Another embodiment herein employs relation detection for knowledge-basedlookup to generate follow-on inquiries. An example of relation detectionis described below:

-   Question: “She appeared on the big screen as Rachel Powell in the    Hollywood blockbuster K-PAX.”

In this example, the question is looking for a specific person's name.An appropriate answer to the question is “Mary McCormack.” According toan embodiment herein, the QA system 210 may use the question text and astructured knowledge base to arrive at the correct answer. Thestructured knowledge base may have an entry labeled Role_In_Movie (“MaryMcCormack”, “Rachel Powell,” “K-PAX”). Thus, the QA system 210 wouldhave gotten the answer correct had it concluded that the question wasasking for the specific variable that satisfies the following structuredquestion: played_Role_In_Movie (???, “Rachel Powell,” “K-PAX”). However,the system does not have a detection pattern for Played_Role_In_Moviethat matches this clue. It does have a pattern for Played_Role_In_Movie(X,Y,Z) of the form “X played Y in Z”. Thus, it knows that if the texthad been “She played Rachel Powell in K-PAX,” then it would havedetected this relation and gotten the question right; and as such, itcould present a context like this to the respondents 270. However, theQA system 210 did not fully understand the following question: “Sheappeared on the big screen as Rachel Powell in the Hollywood blockbusterK-PAX.” It thinks it may be able to simplify this question into thefollowing question: “She played Rachel Powell in K-PAX.” It could thenpresent one or more of the following questions given this context: Isthis a valid simplification? Can you imagine a correct answer to thesimplified form that is not a correct answer to the original question?Can you imagine a correct answer to the original question that is not acorrect answer to the simplified form? The original question uses theword “Hollywood” but the simplified form does not. Is “Hollywood”implied in the simplified form? The original question uses the word“Hollywood” but the simplified form does not. Is “Hollywood” a veryimportant concept for answering this question? Please rank the followingterms from the original clue that are not in the simplified form as mostimportant to least important in terms of what was lost in simplifying:

-   -   a) appeared    -   b) big    -   c) screen    -   d) Hollywood    -   e) blockbuster

To compute the confidence scores for the candidate answers, the QAsystem 210 uses several criteria: e.g., is the candidate answersatisfying similar lexical and semantic relations (e.g. for a queryabout an actress starring in a movie, is the candidate answer a female,and does the candidate answer satisfy actor-in-movie relation?); howwell does the candidate answer and the query align; how well do theterms match, and do the terms exist in similar order.

The language and knowledge processing infrastructure of the QA system210 combines statistical and heuristic techniques to assess its ownknowledge and produce its best candidate answer 285 with an accurateconfidence—a measure of the likelihood it is correct based on aself-assessment of its sources, inference methods, and priorperformance. Unfortunately, some of the respondents 270 on the Internet275 may provide incorrect or misleading result. As noted above, thesystems and methods herein can utilize a crowd sourcing technique toobtain responses to the follow-on inquiries 260. Crowd sourcing involvesasking external sources 272, such as an expert community, which can bemany human Internet/computer users, to provide individual responses tothe follow-on inquiries. Understandably, some of the responses from somerespondents 270 will be incorrect. However, the “correctness” of theresponse is a relative statement because, as shown above in the examplesin FIG. 5, some semantic and grammatical human ways of asking andanswering questions may not be purely logically “correct.” Therefore,what is considered a strong synonym by humans may not be considered asynonym under purely logical rules. Thus, the systems and methods hereinlearn what the majority of the respondents 270 consider to be the“correct” responses to the follow-on inquiries and use such as astandard of what is “correct” in order to become more humanlike.

Further, what is considered to be the “correct” response to a follow-oninquiry 260 can vary from geographic place to place, depending uponlocal language usage of humans in the different areas. Therefore, thesystems and methods herein not only learn from the majority of allrespondents as to what is “correct”, but also learn what the “correct”response is for each of the different geographic areas in which the QAsystem 210 may be utilized. That is, what may be considered a very closesynonym is North America may not be considered a synonym at all inIreland despite sharing the common English language. The systems andmethods herein, therefore, learn what humans consider to be the correctanswer component on many different levels.

According to embodiments herein, the QA system 210 can employ manyalternative approaches to validating responses, three examples of whichare illustrated in FIGS. 6-8. More specifically, FIGS. 6, 7, and 8illustrate verification schemes that may be employed to determine theaccuracy of the information submitted to the QA system 210.

According to FIG. 6, one embodiment herein permits only outside sourceswith demonstrated accuracy in providing previous information 645, suchas correct answers 285 to previous questions, to provide answers tofollow-on inquiries 260. The QA system 210 can fill-in and classifymarginal evidence, and supply answers generally. In the embodiment ofFIG. 6, the same follow-on inquiry 260 is given to multiple potentialrespondents 270. If several respondents 270 provide the same result, theQA system 210 can generally trust the answer.

According to FIG. 7, one embodiment herein permits outside sources tosupply answers that match with the results the QA system 210 hasdetermined regarding all the good and bad evidence 775. Using resultsthat the QA system 210 has already marked as “good” 750 or “bad” 760,the QA system 210 can generally trust the respondents 270 that agreewith the QA system 210 on the good evidence and bad evidence 775. Sincethe accuracy of the QA system 210 regarding the classifications of thegood evidence 750 and the bad evidence 760 is beyond dispute, it can beimplied that respondents 270 that agree with these assessments are morelikely than not to be accurate in order to provide responses tofollow-on inquiries 780 that match with its own assessments, can fill-inand classify marginal evidence, and supply responses generally.

According to FIG. 8, one embodiment herein permits specificallyidentified outside sources to supply answers 885 if they demonstrate asufficient amount of accuracy in relation to a known trusted source 880,such as a human expert. According to this embodiment, a trusted source880 provides a response for several examples. The QA system 210 cangenerally trust respondents 270 that often agree with the trusted source880 on those examples.

Accuracy in any of the embodiments can be determined by tracking thesuccess rate of the outside source or sources 272 in relation to themetrics and comparisons indicated in the above embodiments. This can bedone by keeping track of responses in relation to the above embodimentsfor definite duration or an infinite duration, and it can be doneinternally by the QA system 210, or by another tracking system or methodknown in the art.

Furthermore, the embodiments disclosed in FIGS. 6, 7, and 8 can beemployed individually, or they can be combined in any manner. In someembodiments, the verification techniques illustrated in FIGS. 6, 7, and8 can be used in hybrid verification schemes by mixing the features ofportions of each method. For example, the verification scheme shown inFIG. 8 can compare the determinations of the reliable trusted source 880on good and bad evidence 775 to those of the other respondents 270 todetermine which outside source will be permitted to respond to the QAsystem 210. Similarly, the verification scheme can be a combinationmetric of answers provided to questions, answers to follow-on inquiries,and/or classifications regarding evidence provided by the reliableoutside source in relation to the answers provided to questions, answersto follow-on inquiries, and/or classifications regarding evidenceprovided by the other outside sources. Furthermore, the verificationscheme can combine any of the methods in any manner in order to generatea hybrid metric that permits the outside source to respond to the QAsystem 210.

All of the above schemes and methods enable a QA system to expand itsknowledge base as can be contained in a database, correct its processesfor future questions using external sources, such as human users on anetwork or other QA systems, and generally increase its accuracy. Asshould be apparent, the external or outside sources referencedthroughout this application, whether explicitly stated or not, include,but are not limited to, sources accessible to the QA system 210 such ashuman sources, electronic sources, such as another QA system, or anyother source accessible by the QA system 210 that is capable tofacilitate responses to the QA system 210 according to the principlesembodied in the present disclosure.

The QA system 210 described herein is an application of advanced naturallanguage processing, information retrieval, knowledge representation andreasoning, and machine learning technologies to the field of open-domainquestion answering using technology for hypothesis generation, massiveevidence gathering, analysis, and scoring. In addition to identifying ananswer 285, the QA system 210 described herein determines a confidencein its answer 285. If that confidence is low, the system realizes thatit may not understand the question. For example, there may be a pun orsome syntax that the QA system 210 does not understand. On the otherhand, if the confidence is high, the QA system 210 knows it likelyunderstood the question and stands a good chance of getting thatquestion right. Having sufficient information, the QA system 210 canalways find documents or passages containing some keywords in commonwith the question but may lack the precision, depth, and understandingnecessary to deliver correct answers with accurate confidence.

An additional embodiment herein uses the failures of the QA system togenerate questions to determine whether the outside responding source isa human source. As described above, the failures of the QA systemmotivate follow-on inquiries to employ common sense knowledge and crowdsourcing techniques in order to respond to those follow-on inquiries.According to another embodiment, the QA system can evaluate the natureof the responses received and determine that a respondent is human.

FIG. 9 is a flow diagram illustrating one non-limiting embodimentherein. According to FIG. 9, one embodiment described herein enables acomputer implemented method for validating that a user is human.

The first step of the method, at 910, involves providing a questionanswering system. The QA system uses question answering processes andevidence passages to provide responses to questions.

In the next step of the method, at 920, a question is input to the QAsystem. The question may be in the form of a statement or an answerseeking an appropriate question.

Next, at 930, a failure in a question answering process is determined.The QA system generates one or more candidate answers to the questionwith associated confidence scores based on results from scoringprocesses/algorithms for pieces of evidence extracted from a corpus ofdata. A failure may result either from an inability to generate acandidate answer with a confidence score above a threshold value or ifthe QA system cannot correctly interpret the question. Additionally, afailure may also result from an individual piece of evidence receiving ascore below a threshold value.

At 940, the failure is used to determine a missing piece of information.The missing piece of data/information may be data/information that wouldenable the QA system to improve a score for a piece of evidence, forexample a passage, wherein the score for the piece of evidence is usedin a confidence score for a candidate answer. The missing informationmay be any item of data, a fact, a syntactical relationship, agrammatical relationship, a logical rule, a taxonomy rule, a grammaticalrule, or any other information that would increase a determined scorefor a piece of evidence that may support or refute a candidate answer tothe question.

Next, at 950, an inquiry is output to outside sources in order to obtainthe missing piece of data. The inquiry may be directed to outsidesources that can include a variety of users who can be human users orelectronic systems capable of providing a response, such as other QAsystems. A follow-on inquiry may involve, for example, keyword matching,expansion of the original question, and/or a request for lexicalsemantic relationships. For example, the QA system might request aclarification of what sense a word is being used, or what type ofinformation is being requested. In some embodiments, the follow-oninquiry may ask a question for which the answer is known by the QAsystem. In some embodiments, the follow-on inquiry may ask a questionfor which the answer is not known by the QA system.

At 960, after receiving a response from the responding source, the QAsystem evaluates the nature of the response to determine whether theoutside or external responding source is human 970.

The QA system 210 can do this by keeping track of how other humans haveresponded to questions or follow-on inquiries, and it can compare aresponse from an external source to the tracked responses. Responsesthat agree with responses from other humans can indicate that arespondent is human. These can be the same, similar, or differentfollow-on inquiries and questions. If the response matches well, the QAsystem can determine that the respondent was human. Afterwards, the QAsystem may determine the answers to questions for which the answer isnot already known based on consensus among users who were previouslyknown to be human.

Another method that the QA system can use to determine if the respondingsource is human is to phrase the question or follow-on inquiry innatural language fashion. The QA system can then assess the accuracy ofthe response and/or determine whether or not the response itself is innatural language format. To determine if a response is in naturallanguage format the system can perform several types of analyses. In onenon-limiting example, the QA system 210 can perform a syntactic analysisby parsing the answer to determine if the answer conformation tosyntactic rules of a language, for example English. Similarly, the QAsystem 210 can check to see if the answer is semantically similar bycomparing the predicate argument structure to a collection of humanproduced text. For example, the phrase “A quasar can star in a honormovie” is syntactically well formed but the predicate “star” neveroccurs together with “quasar” and “honor movie.” This can be checked ina lexical database such as Prismatic, as described in J. Fan et al.,“Prismatic: Introducing Knowledge from a Large Scale Lexical RelationResource,” Proceedings of the NAACL HLT 2010 First InternationalWorkshop on Formalisms and Methodology by Reading, pages 122-127, June2010. In some embodiments, the QA system can use general questions,natural language or otherwise, that humans have a known particular wayof responding. Using such general questions, the QA system can determinewhether the responding source is human. These descriptions are merelyexemplary in nature. It is contemplated that those skilled in the artcan conceive and apply additional methods and techniques that willachieve the same result in accordance with the present disclosure andits principles.

The present invention applies to all QA systems now known and as may bedeveloped in the art, and can be implemented by computer or electronicdevices now know (or that may become known) in the art.

According to another embodiment herein, a question answering (QA) systemincludes a processor, an evidence analysis module connected to theprocessor, a first interface connected to the processor and a secondinterface connected to the processor, and a corpus of data connected tothe evidence analysis module. The QA system is the type capable ofproducing evidence in support of a candidate answer. A first question tobe answered by the QA system is provided through the first interface.The processor creates a collection of candidate answers to the firstquestion from the corpus of data. Each candidate answer has supportingevidence and a score generated by the processor. The evidence analysismodule produces a second question (i.e. follow-up inquiry) based on thesupporting evidence. The processor presents the second question throughthe second interface to one or more users. The processor receives atleast one response or knowledge item from one or more users through thesecond interface. The processor validates the at least one response anddetermines whether a user is human based on comparing the at least oneresponse or knowledge item with other answers provided to the secondquestion. In some embodiments, the processor inputs validated responsesinto the corpus of data to improve the ability of the QA system toanswer questions. The new response or knowledge item changes thesupporting evidence for the first question and the scores for thecandidate answers. Based on the changed evidence, the QA systemgenerates an answer to the first question through the first interface.The system can be scaled up or down depending on different applicationrequirements.

According to a further embodiment herein, an article of manufacture isprovided that includes a computer readable medium having computerreadable instructions embodied therein for performing the steps of thecomputer implemented methods, including but not limited to the methodsillustrated in FIGS. 1 and 9. Any combination of one or more computerreadable non-transitory medium(s) may be utilized. The computer readablemedium may be a computer readable signal medium or a computer readablestorage medium. The non-transitory computer storage medium storesinstructions, and a processor executes the instructions to perform themethods described herein. A computer readable storage medium may be, forexample, but not limited to, an electronic, magnetic, optical,electromagnetic, infrared, or semiconductor system, apparatus, ordevice, or any suitable combination of the foregoing. FIG. 10illustrates exemplary articles of manufacture, such as, a magneticstorage device 560, a portable compact disc read-only memory (CD-ROM)570, and a “plug-and-play” memory device 580, like a USB flash drive.Any of these devices may have computer readable instructions forcarrying out the steps of the methods described above with reference toFIGS. 1 and 9.

Aspects of the present disclosure are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems) and computer program products according to variousembodiments. It will be understood that each block of the flowchartillustrations and/or two-dimensional block diagrams, and combinations ofblocks in the flowchart illustrations and/or block diagrams, can beimplemented by computer program instructions. The computer programinstructions may be provided to a processor of a general purposecomputer, special purpose computer, or other programmable dataprocessing apparatus to produce a machine, such that the instructions,which execute via the processor of the computer or other programmabledata processing apparatus, create means for implementing thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The computer program instructions may be stored in a computer readablemedium that can direct a computer, other programmable data processingapparatus, or other devices to function in a particular manner, suchthat the instructions stored in the computer readable medium produce anarticle of manufacture including instructions which implement thefunction/act specified in the flowchart and/or block diagram block orblocks.

Furthermore, the computer program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other devicesto cause a series of operational steps to be performed on the computer,other programmable apparatus or other devices to produce a computerimplemented process such that the instructions which execute on thecomputer or other programmable apparatus provide processes forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks.

In case of implementing the embodiments herein by software and/orfirmware, a program constituting the software may be installed into acomputer with dedicated hardware, from a storage medium or a network,and the computer is capable of performing various functions if withvarious programs installed therein.

A representative hardware environment for practicing the embodimentsherein is depicted in FIG. 11. This schematic drawing illustrates ahardware configuration of an information handling/computer system inaccordance with the embodiments herein. The system comprises at leastone processor or central processing unit (CPU) 610. The CPUs 610 areinterconnected via system bus 612 to various devices such as a randomaccess memory (RAM) 614, read-only memory (ROM) 616, and an input/output(I/O) adapter 618. The I/O adapter 618 can connect to peripheraldevices, such as disk units 611 and tape drives 613, or other programstorage devices that are readable by the system. The system can read theinventive instructions on the program storage devices and follow theseinstructions to execute the methodology of the embodiments herein.

In FIG. 11, CPUs 610 perform various processing based on a programstored in a Read Only Memory (ROM) 616 or a program loaded from aperipheral device, such as disk units 611 and tape drives 613 to aRandom Access Memory (RAM) 614. In the RAM 614, required data when theCPU 610 performs the various processing or the like is also stored asnecessary. The CPU 610, the ROM 616, and the RAM 614 are connected toone another via a bus 612. An input/output adapter 618 is also connectedto the bus 612 to provide an input/output interface, as necessary. Aremovable medium, such as a magnetic disk, an optical disk, amagneto-optical disk, a semiconductor memory, or the like, is installedon the peripheral device, as necessary, so that a computer program readtherefrom may be installed into the RAM 614, as necessary.

The system further includes a user interface adapter 619 that connects akeyboard 615, mouse 617, speaker 624, microphone 622, and/or other userinterface devices such as a touch screen device (not shown) to the bus612 to gather user input. Additionally, a communication adapter 620including a network interface card such as a LAN card, a modem, or thelike connects the bus 612 to a data processing network 625. Thecommunication adapter 620 performs communication processing via anetwork such as the Internet. A display adapter 621 connects the bus 612to a display device 623, which may be embodied as an output device suchas a monitor (such as a Cathode Ray Tube (CRT), a Liquid Crystal Display(LCD), or the like), printer, or transmitter, for example.

In the case where the above-described series of processing isimplemented with software, the program that constitutes the software maybe installed from a network such as the Internet or a storage mediumsuch as the removable medium.

Those skilled in the art would appreciate that, the storage medium isnot limited to the peripheral device having the program stored thereinas illustrated in FIG. 11, which is distributed separately from thedevice for providing the program to the user. Examples of a removablemedium include a magnetic disk (including a floppy disk), an opticaldisk (including a Compact Disk-Read Only Memory (CD-ROM) and a DigitalVersatile Disk (DVD)), a magneto-optical disk (including a Mini-Disk(MD) (registered trademark)), and a semiconductor memory. Alternatively,the storage medium may be the ROM 616, a hard disk contained in thestorage section 611, or the like, which has the program stored thereinand is distributed to the user together with the device that containsthem.

As will be appreciated by one skilled in the art, aspects of the systemsand methods herein may be embodied as a system, method, or computerprogram product. Accordingly, aspects of the present disclosure may takethe form of an entirely hardware embodiment, an entirely softwareembodiment (including firmware, resident software, micro-code, etc.) oran embodiment combining software and hardware aspects that may allgenerally be referred to herein as a “circuit,” “module” or “system.”Furthermore, aspects of the present disclosure may take the form of acomputer program product embodied in one or more computer readablemedium(s) having computer readable program code embodied thereon.

Any combination of one or more computer readable non-transitorymedium(s) may be utilized. The computer readable medium may be acomputer readable signal medium or a computer readable storage medium.The non-transitory computer storage medium stores instructions, and aprocessor executes the instructions to perform the methods describedherein. A computer readable storage medium may be, for example, but notlimited to, an electronic, magnetic, optical, electromagnetic, infrared,or semiconductor system, apparatus, or device, or any suitablecombination of the foregoing. More specific examples (a non-exhaustivelist) of the computer readable storage medium include the following: anelectrical connection having one or more wires, a portable computerdiskette, a hard disk, a random access memory (RAM), a read-only memory(ROM), an erasable programmable read-only memory (EPROM or Flashmemory), an optical fiber, a magnetic storage device 560(FIG. 10), aportable compact disc read-only memory (CD-ROM) 570, an optical storagedevice, a “plug-and-play” memory device 580, like a USB flash drive, orany suitable combination of the foregoing. In the context of thisdocument, a computer readable storage medium may be any tangible mediumthat can contain, or store a program for use by or in connection with aninstruction execution system, apparatus, or device.

A computer readable signal medium may include a propagated data signalwith computer readable program code embodied therein, for example, inbaseband or as part of a carrier wave. Such a propagated signal may takeany of a variety of forms, including, but not limited to,electro-magnetic, optical, or any suitable combination thereof. Acomputer readable signal medium may be any computer readable medium thatis not a computer readable storage medium and that can communicate,propagate, or transport a program for use by or in connection with aninstruction execution system, apparatus, or device.

Program code embodied on a computer readable medium may be transmittedusing any appropriate medium, including but not limited to wireless,wireline, optical fiber cable, RF, etc., or any suitable combination ofthe foregoing.

Computer program code for carrying out operations for aspects of thepresent disclosure may be written in any combination of one or moreprogramming languages, including an object oriented programming languagesuch as Java, Smalltalk, C++, or the like and conventional proceduralprogramming languages, such as the “C” programming language or similarprogramming languages. The program code may execute entirely on theuser's computer, partly on the user's computer, as a stand-alonesoftware package, partly on the user's computer and partly on a remotecomputer, or entirely on the remote computer or server. In the latterscenario, the remote computer may be connected to the user's computerthrough any type of network, including a local area network (LAN) or awide area network (WAN), or the connection may be made to an externalcomputer (for example, through the Internet using an Internet ServiceProvider).

The flowchart and block diagrams in the figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments herein. In this regard, each block in the flowchart or blockdiagrams may represent a module, segment, or portion of code, whichcomprises one or more executable instructions for implementing thespecified logical function(s). It should also be noted that, in somealternative implementations, the functions noted in the block may occurout of the order noted in the figures. For example, two blocks shown insuccession may, in fact, be executed substantially concurrently, or theblocks may sometimes be executed in the reverse order, depending uponthe functionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts, or combinations of special purpose hardware andcomputer instructions.

Deployment types include loading directly in the client, server, andproxy computers via loading a storage medium such as a CD, DVD, etc. Theprocess software may also be automatically or semi-automaticallydeployed into a computer system by sending the process software to acentral server or a group of central servers. The process software isthen downloaded into the client computers that will execute the processsoftware. The process software is sent directly to the client system viae-mail. The process software is then either detached to a directory orloaded into a directory by a button on the e-mail that executes aprogram that detaches the process software into a directory.Alternatively, the process software is sent directly to a directory onthe client computer hard drive. When there are proxy servers, theprocess will select the proxy server code, determine on which computersto place the proxy servers' code, transmit the proxy server code, andthen install the proxy server code on the proxy computer. The processsoftware will be transmitted to the proxy server, and then stored on theproxy server.

While it is understood that the process software may be deployed bymanually loading directly in the client, server, and proxy computers vialoading a storage medium such as a CD, DVD, etc., the process softwaremay also be automatically or semi-automatically deployed into a computersystem by sending the process software to a central server or a group ofcentral servers. The process software is then downloaded into the clientcomputers that will execute the process software. Alternatively, theprocess software is sent directly to the client system via e-mail. Theprocess software is then either detached to a directory or loaded into adirectory by a button on the e-mail that executes a program thatdetaches the process software into a directory. Another alternative isto send the process software directly to a directory on the clientcomputer hard drive. When there are proxy servers, the process willselect the proxy server code, determine on which computers to place theproxy servers' code, transmit the proxy server code, and then installthe proxy server code on the proxy computer. The process software willbe transmitted to the proxy server, and then stored on the proxy server.

In FIG. 12, step 700 begins the deployment of the process software. Thefirst thing is to determine if there are any programs that will resideon a server or servers when the process software is executed 701. Ifthis is the case, then the servers that will contain the executables areidentified 809. The process software for the server or servers istransferred directly to the servers' storage via FTP or some otherprotocol, or by copying through the use of a shared file system 810. Theprocess software is then installed on the servers 811.

Next, a determination is made on whether the process software is to bedeployed by having users access the process software on a server orservers 702. If the users are to access the process software on servers,then the server addresses that will store the process software areidentified 703.

A determination is made if a proxy server is to be built 800 to storethe process software. A proxy server is a server that sits between aclient application, such as a Web browser, and a real server. Itintercepts all requests to the real server to see if it can fulfill therequests itself. If not, it forwards the request to the real server. Thetwo primary benefits of a proxy server are to improve performance and tofilter requests. If a proxy server is required, then the proxy server isinstalled 801. The process software is sent to the servers either via aprotocol such as FTP or it is copied directly from the source files tothe server files via file sharing 802. Another embodiment would be tosend a transaction to the servers that contain the process software andhave the server process the transaction, then receive and copy theprocess software to the server's file system. Once the process softwareis stored at the servers, the users, via their client computers, thenaccess the process software on the servers and copy it to their clientcomputers file systems 803. Another embodiment is to have the serversautomatically copy the process software to each client and then run theinstallation program for the process software at each client computer.The users execute the program that installs the process software ontheir client computer 812, and then exit the process 708.

In step 704, a determination is made whether the process software is tobe deployed by sending the process software to users via e-mail. The setof users where the process software will be deployed are identifiedtogether with the addresses of the user client computers 705. Theprocess software is sent via e-mail 804 to each of the users' clientcomputers. The users receive the e-mail 805 and then detach the processsoftware from the e-mail to a directory on their client computers 806.The users execute the program that installs the process software ontheir client computer 812, and then exit the process 708.

Lastly, a determination is made on whether to the process software willbe sent directly to user directories on their client computers 706. Ifso, the user directories are identified 707. The process software istransferred directly to the users' client computer directory 807. Thiscan be done in several ways such as but not limited to sharing of thefile system directories and then copying from the sender's file systemto the recipient users' file system or alternatively using a transferprotocol such as File Transfer Protocol (FTP). The users access thedirectories on their client file systems in preparation for installingthe process software 808. The users execute the program that installsthe process software on their client computer 812, and then exit theprocess 708.

The process software is integrated into a client, server, and networkenvironment by providing for the process software to coexist withapplications, operating systems, and network operating systems software,and then installing the process software on the clients and servers inthe environment where the process software will function.

The first step is to identify any software on the clients and serversincluding the network operating system where the process software willbe deployed that are required by the process software or that work inconjunction with the process software. This includes the networkoperating system that is software that enhances a basic operating systemby adding networking features.

Next, the software applications and version numbers will be identifiedand compared to the list of software applications and version numbersthat have been tested to work with the process software. Those softwareapplications that are missing or that do not match the correct versionwill be upgraded with the correct version numbers. Program instructionsthat pass parameters from the process software to the softwareapplications will be checked to ensure the parameter lists match theparameter lists required by the process software. Conversely, parameterspassed by the software applications to the process software will bechecked to ensure the parameters match the parameters required by theprocess software. The client and server operating systems including thenetwork operating systems will be identified and compared to the list ofoperating systems, version numbers, and network software that have beentested to work with the process software. Those operating systems,version numbers, and network software that do not match the list oftested operating systems and version numbers will be upgraded on theclients and servers to the required level.

After ensuring that the software, where the process software is to bedeployed, is at the correct version level that has been tested to workwith the process software, the integration is completed by installingthe process software on the clients and servers.

In FIG. 13, step 820 begins the integration of the process software. Thefirst thing is to determine if there are any process software programsthat will execute on a server or servers 821. If this is not the case,then integration proceeds to 827. If this is the case, then the serveraddresses are identified 822. The servers are checked to see if theycontain software that includes the operating system (OS), applications,and network operating systems (NOS), together with their versionnumbers, that have been tested with the process software 823. Theservers are also checked to determine if there is any missing softwarethat is required by the process software 823.

A determination is made if the version numbers match the version numbersof OS, applications, and NOS that have been tested with the processsoftware 824. If all of the versions match and there is no missingrequired software, the integration continues in 827.

If one or more of the version numbers do not match, then the unmatchedversions are updated on the server or servers with the correct versions825. Additionally, if there is missing required software, then it isupdated on the server or servers 825. The server integration iscompleted by installing the process software 826.

Step 827, which follows either step 821, 824, or 826, determines ifthere are any programs of the process software that will execute on theclients. If no process software programs execute on the clients, theintegration proceeds to 830 and exits. If this not the case, then theclient addresses are identified 828.

The clients are checked to see if they contain software that includesthe operating system (OS), applications, and network operating systems(NOS), together with their version numbers, that have been tested withthe process software 829. The clients are also checked to determine ifthere is any missing software that is required by the process software829.

A determination is made as to whether the version numbers match theversion numbers of OS, applications, and NOS that have been tested withthe process software 831. If all of the versions match and there is nomissing required software, then the integration proceeds to 830 andexits.

If one or more of the version numbers do not match, then the unmatchedversions are updated on the clients with the correct versions 832. Inaddition, if there is missing required software then it is updated onthe clients 832. The client integration is completed by installing theprocess software on the clients 833. The integration proceeds to 830 andexits.

The process software can be stored on a shared file system accessiblefrom one or more servers. The process software is executed viatransactions that contain data and server processing requests that useCPU units on the accessed server. CPU units are units of time such asminutes, seconds, hours on the central processor of the server.Additionally, the assessed server may make requests of other serversthat require CPU units. CPU units are an example that represents onemeasurement of use. Other measurements of use include but are notlimited to network bandwidth, memory usage, storage usage, packettransfers, complete transactions etc. When multiple customers use thesame process software application, their transactions are differentiatedby the parameters included in the transactions that identify the uniquecustomer and the type of service for that customer. All of the CPU unitsand other measurements of use that are used for the services for eachcustomer are recorded. When the number of transactions to any one serverreaches a number that begins to affect the performance of that server,other servers are accessed to increase the capacity and to share theworkload. Likewise, when other measurements of use such as networkbandwidth, memory usage, storage usage, etc. approach a capacity so asto affect performance, additional network bandwidth, memory usage,storage etc. are added to share the workload. The measurements of useused for each service and customer are sent to a collecting server thatsums the measurements of use for each customer for each service that wasprocessed anywhere in the network of servers that provide the sharedexecution of the process software. The summed measurements of use unitsare periodically multiplied by unit costs and the resulting totalprocess software application service costs are alternatively sent to thecustomer and/or indicated on a web site accessed by the customer, whichthen remits payment to the service provider. In another embodiment, theservice provider requests payment directly from a customer account at abanking or financial institution. In another embodiment, if the serviceprovider is also a customer of the customer that uses the processsoftware application, the payment owed to the service provider isreconciled to the payment owed by the service provider to minimize thetransfer of payments.

The process software is shared, simultaneously serving multiplecustomers in a flexible, automated fashion. It is standardized,requiring little customization, and it is scalable, providing capacityon demand in a pay-as-you-go model.

The process software can be stored on a shared file system accessiblefrom one or more servers. The process software is executed viatransactions that contain data and server processing requests that useCPU units on the accessed server. CPU units are units of time such asminutes, seconds, hours on the central processor of the server.Additionally, the assessed server may make requests of other serversthat require CPU units. CPU units are an example that represents onemeasurement of use. Other measurements of use include but are notlimited to network bandwidth, memory usage, storage usage, packettransfers, complete transactions etc.

When multiple customers use the same process software application, theirtransactions are differentiated by the parameters included in thetransactions that identify the unique customer and the type of servicefor that customer. All of the CPU units and other measurements of usethat are used for the services for each customer are recorded. When thenumber of transactions to any one server reaches a number that begins toaffect the performance of that server, other servers are accessed toincrease the capacity and to share the workload. Likewise, when othermeasurements of use such as network bandwidth, memory usage, storageusage, etc. approach a capacity so as to affect performance, additionalnetwork bandwidth, memory usage, storage etc. are added to share theworkload.

The measurements of use used for each service and customer are sent to acollecting server that sums the measurements of use for each customerfor each service that was processed anywhere in the network of serversthat provide the shared execution of the process software. The summedmeasurements of use units are periodically multiplied by unit costs andthe resulting total process software application service costs arealternatively sent to the customer and/or indicated on a web siteaccessed by the customer, which then remits payment to the serviceprovider.

In another embodiment, the service provider requests payment directlyfrom a customer account at a banking or financial institution.

In another embodiment, if the service provider is also a customer of thecustomer that uses the process software application, the payment owed tothe service provider is reconciled to the payment owed by the serviceprovider to minimize the transfer of payments.

In FIG. 14, step 840 begins the On Demand process. A transaction iscreated that contains the unique customer identification, the requestedservice type, and any service parameters that further specify the typeof service 841. The transaction is then sent to the main server 842. Inan On Demand environment, the main server can initially be the onlyserver, then, as capacity is consumed, other servers are added to the OnDemand environment.

The server central processing unit (CPU) capacities in the On Demandenvironment are queried 843. The CPU requirement of the transaction isestimated, then the servers' available CPU capacity in the On Demandenvironment are compared to the transaction CPU requirement to see ifthere is sufficient CPU capacity available in any server to process thetransaction 844. If there is not sufficient server CPU capacityavailable, then additional server CPU capacity is allocated to processthe transaction 848. If there was already sufficient CPU capacityavailable, then the transaction is sent to a selected server 845.

Before executing the transaction, a check is made of the remaining OnDemand environment to determine if the environment has sufficientavailable capacity for processing the transaction. This environmentcapacity consists of such things as, but not limited to, networkbandwidth, processor memory, storage etc. 846. If there is notsufficient available capacity, then capacity will be added to the OnDemand environment 847. Next, the required software to process thetransaction is accessed, loaded into memory, then the transaction isexecuted 849.

The usage measurements are recorded 850. The usage measurements consistof the portions of those functions in the On Demand environment that areused to process the transaction. The usage of such functions as, but notlimited to, network bandwidth, processor memory, storage and CPU cyclesare what is recorded. The usage measurements are summed, multiplied byunit costs, and then recorded as a charge to the requesting customer851. If the customer has requested that the On Demand costs be posted toa web site 852, then they are posted 853.

If the customer has requested that the On Demand costs be sent viae-mail to a customer address 854, then they are sent 855. If thecustomer has requested that the On Demand costs be paid directly from acustomer account 856, then payment is received directly from thecustomer account 857. The last step is to exit the On Demand process858.

The process software may be deployed, accessed and executed through theuse of a virtual private network (VPN), which is any combination oftechnologies that can be used to secure a connection through anotherwise unsecured or untrusted network. The use of VPNs is to improvesecurity and for reduced operational costs. The VPN makes use of apublic network, usually the Internet, to connect remote sites or userstogether. Instead of using a dedicated, real-world connection such asleased line, the VPN uses “virtual” connections routed through theInternet from the company's private network to the remote site oremployee.

The process software may be deployed, accessed, and executed througheither a remote-access or a site-to-site VPN. When using theremote-access VPNs, the process software is deployed, accessed, andexecuted via the secure, encrypted connections between a company'sprivate network and remote users through a third-party service provider.The enterprise service provider (ESP) sets a network access server (NAS)and provides the remote users with desktop client software for theircomputers. The telecommuters can then dial a toll-free number or attachdirectly via a cable or DSL modem to reach the NAS and use their VPNclient software to access the corporate network and to access, download,and execute the process software.

When using the site-to-site VPN, the process software is deployed,accessed, and executed through the use of dedicated equipment andlarge-scale encryption, which are used to connect a company's multiplefixed sites over a public network, such as the Internet.

The process software is transported over the VPN via tunneling, which isthe process of placing an entire packet within another packet andsending it over a network. The protocol of the outer packet isunderstood by the network and both points, called tunnel interfaces,where the packet enters and exits the network.

In FIGS. 15 and 16, step 860 begins the Virtual Private Network (VPN)process. A determination is made to see if a VPN for remote access isrequired 861. If it is not required, then proceed to 862. If it isrequired, then determine if the remote access VPN exists 864.

If it does exist, then proceed to 865. Otherwise, identify the thirdparty provider that will provide the secure, encrypted connectionsbetween the company's private network and the company's remote users876. The company's remote users are identified 877. The third partyprovider then sets up a network access server (NAS) 878 that allows theremote users to dial a toll-free number or attach directly via a cableor DSL modem to access, download, and install the desktop clientsoftware for the remote-access VPN 879.

After the remote access VPN has been built, or if it been previouslyinstalled, the remote users can then access the process software bydialing into the NAS or attaching directly via a cable or DSL modem intothe NAS 865. This allows entry into the corporate network where theprocess software is accessed 866. The process software is transported tothe remote users' desktop over the network via tunneling. That is, theprocess software is divided into packets and each packet, including thedata and protocol, is placed within another packet 867. When the processsoftware arrives at the remote users' desktop, it is removed from thepackets, reconstituted, and executed on the remote users' desktop 868.

A determination is made to see if a VPN for site-to-site access isrequired 862. If it is not required, then proceed to exit the process863. Otherwise, determine if the site-to-site VPN exists 869. If it doesexist, then proceed to 872. Otherwise, install the dedicated equipmentrequired to establish a site-to-site VPN 870. Then build the large-scaleencryption into the VPN 871.

After the site-to-site VPN has been built, or if it had been previouslyestablished, the users access the process software via the VPN 872. Theprocess software is transported to the site users over the network viatunneling 873. That is, the process software is divided into packets andeach packet, including the data and protocol, is placed within anotherpacket 874. When the process software arrives at the remote users'desktop, it is removed from the packets, reconstituted, and executed onthe site users' desktop 875. Proceed to exit the process 863.

The terminology used herein is for the purpose of describing particularembodiments only and is not intended to be limiting of this disclosure.As used herein, the singular forms “a”, “an”, and “the” are intended toinclude the plural forms as well, unless the context clearly indicatesotherwise. It will be further understood that the terms “comprises”and/or “comprising,” when used in this specification, specify thepresence of stated features, integers, steps, operations, elements,and/or components, but do not preclude the presence or addition of oneor more other features, integers, steps, operations, elements,components, and/or groups thereof.

The corresponding structures, materials, acts, and equivalents of allmeans or step plus function elements in the claims below are intended toinclude any structure, material, or act for performing the function incombination with other claimed elements as specifically claimed. Thedescriptions of the various embodiments herein have been presented forpurposes of illustration, but are not intended to be exhaustive orlimited to the embodiments disclosed. Many modifications and variationswill be apparent to those of ordinary skill in the art without departingfrom the scope and spirit of the described embodiments. The terminologyused herein was chosen to best explain the principles of theembodiments, the practical application or technical improvement overtechnologies found in the marketplace, or to enable others of ordinaryskill in the art to understand the embodiments disclosed herein.

What is claimed is:
 1. A device, comprising: a question-answer systemcomprising software for performing question answering processes; areceiver receiving a question into said question-answer system; aprocessor connected to said question-answer system, said processor:generating candidate answers to said question, determining a confidencescore for each of said candidate answers, evaluating sources of evidenceused to generate said candidate answers, identifying missing informationfrom a corpus of data, said missing information comprising anyinformation that improves the ability of said question-answer system tounderstand and evaluate said sources of evidence associated with saidcandidate answers, said identifying said missing information comprising:evaluating a piece of evidence and producing an evidence score accordingto a scoring process that reflects a degree to which said piece ofevidence supports or refutes said candidate answer, classifying saidpiece of evidence as marginal evidence if said score is below athreshold value, and evaluating said marginal evidence to identify saidmissing information, and generating a follow-on inquiry based on saidmissing information; and a network interface outputting said follow-oninquiry to external sources separate from said question-answer system toobtain responses to said follow-on inquiry and receiving from saidexternal sources a response to said follow-on inquiry, said processorinputting said response into said corpus of data.
 2. The deviceaccording to claim 1, said processor developing additional logical rulesand additional evidence for said question-answer system based on saidresponse to said follow-on inquiry.
 3. The device according to claim 1,in response to said processor inputting said response into saidquestion-answer system, said processor re-evaluating said confidencescore for said candidate answers.
 4. The device according to claim 1,further comprising: a parser, said parser: parsing said question into afirst collection of elements; and parsing said piece of evidence into asecond collection of elements; and said processor further analyzing saidfirst collection of elements for said question and said piece ofevidence in order to determine a relationship between a first element ofsaid first collection of elements for said question and a second elementof said second collection of elements for said piece of evidence; andlocating a missing relationship between said first element and saidsecond element, said missing relationship comprising said missinginformation.
 5. The device according to claim 1, said processor further:evaluating whether said confidence score for any of said candidateanswers is below a confidence threshold value; and if said confidencescore is below said confidence threshold value, triggering said step ofidentifying missing information.
 6. The device according to claim 1,said processor validating that said response is accurate prior to addingsaid response to said corpus of data, said validating comprising one of:identifying a most common response received from external sources andselecting said most common response; selecting responses from firstoutside sources, said first outside sources having a previouslyestablished level of accuracy in providing previous responses toprevious inquiries; when said question-answer system comprises a totalamount of evidence, said total amount of evidence having been classifiedby said question-answer system as one of good evidence and bad evidence,said good evidence enabling said question-answer system to provide acorrect answer to said question, and said bad evidence being irrelevantto said question, selecting responses from second outside sources, saidsecond outside sources having a previously established level of accuracyin classifying said total amount of evidence as good evidence or badevidence that matches classification of said total amount of evidence bysaid question-answer system as good evidence and bad evidence; andselecting answers from third outside sources, said third outside sourceshaving demonstrated a previously established level of accuracy inproviding responses that match responses from a known source.
 7. Aquestion answering (QA) system comprising: a processor; an evidenceanalysis module connected to said processor; a first interface connectedto said processor; a second interface connected to said processor; and acorpus of data connected to said evidence analysis module, said corpusof data comprising a total amount of evidence, said evidence analysismodule classifying said total amount of evidence as good evidence ormarginal evidence, said first interface receiving a first question to beanswered by said QA system, said processor creating a collection ofcandidate answers to said first question from said corpus of data, eachsaid candidate answer having supporting evidence and a confidence scoregenerated by said processor, said good evidence having an evidence scoreabove a previously established evidence threshold value, and enablingsaid QA system to provide a candidate answer to said first question witha confidence score above a previously established confidence thresholdvalue, and said marginal evidence having an evidence score below saidpreviously established evidence threshold value, enabling said QA systemto provide a candidate answer to said first question with a confidencescore below said previously established confidence threshold value, andrequiring said QA system to obtain additional information to improvesaid confidence score for said candidate answer to said first question,said processor identifying missing information from said corpus of data,said missing information comprising any information that improves theability of said QA system to understand and evaluate said supportingevidence associated with said collection of candidate answers, saidevidence analysis module producing a second question based on saidmissing information, said processor presenting said second questionthrough said second interface to external sources separate from said QAsystem to obtain responses to said second question, said processorreceiving a response or knowledge item from said external sourcesthrough said second interface, said processor inputting said response orknowledge item into said corpus of data, and said processorautomatically developing additional logical rules and additionalevidence for said QA system based on said response or knowledge item. 8.The question answering system according to claim 7, further comprising:said evidence analysis module evaluating a piece of evidence relevant tosaid candidate answer to identify said missing information, said missinginformation comprising a data, a fact, a syntactical relationship, agrammatical relationship, a logical rule, or a taxonomy rule.
 9. Thequestion answering system according to claim 7, said corpus of datafurther comprising textual passages, said processor comparing said firstquestion to said textual passages to provide said collection ofcandidate answers.
 10. The question answering system according to claim7, said processor further evaluating said response by at least one of:identifying a most common response received from said external sourcesand selecting said most common response; selecting responses from firstoutside sources, said first outside sources having a previouslyestablished level of accuracy in providing previous responses toprevious inquiries; when said QA system comprises a total amount ofevidence, said total amount of evidence having been classified by saidQA system as one of good evidence and bad evidence, said good evidenceenabling said QA system to provide a correct answer to said question,and said bad evidence being irrelevant to said question, selectingresponses from second outside sources, said second outside sourceshaving a previously established level of accuracy in classifying saidtotal amount of evidence as good evidence or bad evidence that matchesclassification of said total amount of evidence by said QA system asgood evidence and bad evidence; and selecting answers from third outsidesources, said third outside sources having demonstrated a previouslyestablished level of accuracy in providing responses that matchresponses from a known source.
 11. The question answering systemaccording to claim 7, in response to inputting said response orknowledge item into said corpus of data, said processor re-evaluatingsaid confidence score for said collection of candidate answers.
 12. Amethod, comprising: generating candidate answer to a questiondetermining a confidence score for each of said candidate answers;evaluating sources of evidence used to generate said candidates answers;identifying missing information from a corpus of data, said missinginformation comprising any information that improves the ability of aquestion-answer system to understand and evaluate sources of evidenceassociated with a candidate answer; generating a follow-on inquiry, saidfollow-on inquiry prompting for said missing information to be provided;outputting said follow-on inquiry to an external source; receiving aresponse to said follow-on inquiry from said external source; and addingsaid response to said corpus of data.
 13. The method of claim 12, saidmissing information comprising a data, a fact, a syntacticalrelationship, a grammatical relationship, a logical rule, or a taxonomyrule.
 14. The method according to claim 12, said identifying missinginformation further comprising: evaluating a piece of evidence relevantto said candidate answer to identify said missing information, saidmissing information comprising a data, a fact, a syntacticalrelationship, a grammatical relationship, a logical rule, or a taxonomyrule.
 15. The method according to claim 14, said evaluating a piece ofevidence further comprising: parsing said question into a firstcollection of elements; parsing said piece of evidence into a secondcollection of elements; analyzing said first collection of elements forsaid question and said piece of evidence in order to determine arelationship between an element of said first collection of elements forsaid question and an element of said second collection of elements forsaid piece of evidence, said relationship between elements comprising alexical relationship, a grammatical relationship, or a semanticrelationship; and locating a missing relationship between said elements,said missing relationship comprising said missing information.
 16. Themethod of claim 15, said missing relationship between said first elementand said second element comprising a lexical relationship, a grammaticalrelationship, or a semantic relationship.
 17. The method according toclaim 12, further comprising: receiving said question into saidquestion-answer system; and generating said candidate answer to saidquestion using said question-answer system and said corpus of data. 18.The method according to claim 17, said identifying said missinginformation further comprising: evaluating a piece of evidence toproduce an evidence score according to a scoring process that reflects adegree to which said piece of evidence supports or refutes saidcandidate answer; classifying said piece of evidence as marginalevidence if said score is below a threshold value; and evaluating saidmarginal evidence to identify said missing information.
 19. The methodaccording to claim 12, further comprising: evaluating whether saidconfidence score is below a confidence threshold value; and if saidconfidence score is below said confidence threshold value, triggeringsaid step of identifying missing information.
 20. The method accordingto claim 12, further comprising: validating said response is accurateprior to adding said response to said corpus of data, said validatingcomprising one of: identifying a most common response received fromexternal sources and selecting said most common response; selectingresponses from first outside sources, said first outside sources havinga previously established level of accuracy in providing previousresponses to previous inquiries; when said question-answer systemcomprises a total amount of evidence, said total amount of evidencehaving been classified by said question-answer system as one of goodevidence and bad evidence, said good evidence enabling saidquestion-answer system to provide a correct answer to said question, andsaid bad evidence being irrelevant to said question, selecting responsesfrom second outside sources, said second outside sources having apreviously established level of accuracy in classifying said totalamount of evidence as good evidence or bad evidence that matchesclassification of said total amount of evidence by said question-answersystem as good evidence and bad evidence; and selecting answers fromthird outside sources, said third outside sources having demonstrated apreviously established level of accuracy in providing responses thatmatch responses from a known source.